Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
k-nearest neighbor data imputation algorithm combined with locality sensitive Hashing
ZHENG Qibin, DIAO Xingchun, CAO Jianjun, ZHOU Xing, XU Yongping
Journal of Computer Applications    2016, 36 (2): 397-401.   DOI: 10.11772/j.issn.1001-9081.2016.02.0397
Abstract555)      PDF (814KB)(969)       Save
k-Nearest Neighbor ( kNN) algorithm is commonly used in data imputation. It is of poor efficiency because of the similarity computation between every tow records. To solve the efficiency problem, an improved kNN data imputation algorithm combined with Locality Sensitive Hashing (LSH) named LSH- kNN was proposed. First, all the complete records were indexed in LSH way. Then corresponding LSH ways for nominal, numeric and mixed-type incomplete data were put forward, and LSH values for all the incomplete records were computed in the proposed way to find candidate similar records. Finally, the incomplete records' real distance to candidate similar records were calculated, and the top- k similar records for kNN imputation were found. The experimental results show that the proposed method LSH- kNN has higher efficiency than traditional kNN as well as keeping almost the same accuracy.
Reference | Related Articles | Metrics